On Delay-Optimal Scheduling in Queueing Systems with Replications

نویسندگان

  • Yin Sun
  • Can Emre Koksal
  • Ness B. Shroff
چکیده

In modern computer systems, long-running jobs are divided into a large number of short tasks and executed in parallel. Experience in practical systems suggests that task service times are highly random and the job service latency is bottlenecked by the slowest straggling task. One common solution for straggler mitigation is to replicate a task on multiple servers and wait for one replica of the task to finish early. The delay performance of replications depends heavily on the scheduling decisions of when to replicate, which servers to replicate on, and which task to serve first. So far, little is understood on how to optimize these scheduling decisions for minimizing the job service latency. In this paper, we present a comprehensive theoretical analysis on delay-optimal scheduling in queueing systems with replications. In particular, low-complexity replication policies are designed, and are rigorously proven to be delayoptimal or near delay-optimal among all non-preemptive and causal policies. These theoretical results are established for very general system settings and delay metrics which allow for arbitrary arrival process, arbitrary job sizes, arbitrary soft deadlines, and heterogeneous servers with data locality constraints. In order to prove these results, novel sufficient conditions are developed for sample-path delay optimality and near delay optimality, which can be applied to any queueing system and are not limited to the study of replications. Index Terms Queueing systems with replications, delay optimality, data locality, sample-path ordering, workefficiency ordering, stochastic ordering. Yin Sun and C. Emre Koksal are with the Department of Electrical and Computer Engineering, the Ohio State University, Columbus, OH. Email: [email protected], [email protected]. Ness B. Shroff is with the Departments of Electrical and Computer Engineering and Computer Science and Engineering, the Ohio State University, Columbus, OH. Email: [email protected].

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Yin Sun

In modern computer systems, long-running jobs are divided into a large number of short tasks and executed in parallel. As the system size and complexity scales up, the task service times become highly random and the job service time is constrained by the slowest straggling tasks. An efficient solution is to replicate a task on multiple servers and wait for one task copy to finish early. However...

متن کامل

Incentive-Compatible Revenue Management in Queueing Systems: Optimal Strategic Delay and other Delaying Tactics

How should a capacity-constrained firm design an incentive-compatible price-scheduling mechanism to maximize revenues from a heterogeneous pool of time-sensitive customers with private information on their willingness to pay, time-sensitivity and processing requirement? We consider this question in the context of a queueing system that serves two customer types. We provide the following insight...

متن کامل

Delay-Optimal Scheduling for Queueing Systems with Switching Overhead

We study the scheduling polices for asymptotically optimal delay in queueing systems with switching overhead. Such systems consist of a single server that serves multiple queues, and some capacity is lost whenever the server switches to serve a different set of queues. The capacity loss due to this switching overhead can be significant in many emerging applications, and needs to be explicitly a...

متن کامل

Incentive-Compatible Revenue Management in Queueing Systems: Optimal Strategic Delay

H should a firm design a price/lead-time menu and scheduling policy to maximize revenues from heterogeneous time-sensitive customers with private information about their preferences? We consider this question for a queueing system with two customer types and provide the following results. First, we develop a novel problem formulation and solution method that combines the achievable region appro...

متن کامل

Near Delay-Optimal Scheduling of Batch Jobs in Multi-Server Systems

We study a class of scheduling problems, where each job is divided into a batch of unit-size tasks and these tasks can be executed in parallel on multiple servers with New-Better-than-Used (NBU) service time distributions. While many delay optimality results are available for single-server queueing systems, generalizing these results to the multi-server case has been challenging. This motivated...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • CoRR

دوره abs/1603.07322  شماره 

صفحات  -

تاریخ انتشار 2016